Online Estimating The Top k Nodes Of A Network
نویسندگان
چکیده
The goal of this paper is to estimate the top k central nodes in a network through parsimonious sampling, in an online fashion. We consider three centrality metrics: degree, betweenness, and closeness centrality. We identify and investigate through simulations the contributions of two sources of error in finding central nodes: (1) sampling (collection) error and (2) identification error. Sampling error occurs when a node in the top k most central nodes is not sampled by the sampling algorithm. Identification error occurs when a sampled top k node is not identified as such. We observe that among the analyzed traces, random walks yield low sampling error. Using degree centrality as an alias for identifying nodes with high betweenness and closeness centrality, we then show that degree information collected while sampling the network can be used to reduce identification errors. As a consequence of these two observations, a random walk using degree to identify the top k betweenness and closeness central nodes performs better, in the analyzed traces, for small values of sampled fraction of nodes (1-5%), than more complex strategies recently proposed in the literature.
منابع مشابه
Markov Chain Anticipation for the Online Traveling Salesman Problem by Simulated Annealing Algorithm
The arc costs are assumed to be online parameters of the network and decisions should be made while the costs of arcs are not known. The policies determine the permitted nodes and arcs to traverse and they are generally defined according to the departure nodes of the current policy nodes. In on-line created tours arc costs are not available for decision makers. The on-line traversed nodes are f...
متن کاملComparison and evaluation of the performance of data-driven models for estimating suspended sediment downstream of Doroodzan Dam
Dams control most of the sediment entering the reservoir by creating static environments. However, sediment leaving the dam depends on various factors such as dam management method, inlet sediment, water height in the reservoir, the shape of the reservoir, and discharge flow. In this research, the amount of suspended sediment of Doroodzan Dam based on a statistical period of 25 years has been i...
متن کاملEstimation of Network Reliability for a Fully Connected Network with Unreliable Nodes and Unreliable Edges using Neuro Optimization
In this paper it is tried to estimate the reliability of a fully connected network of some unreliable nodes and unreliable connections (edges) between them. The proliferation of electronic messaging has been witnessed during the last few years. The acute problem of node failure and connection failure is frequently encountered in communication through various types of networks. We know that a ne...
متن کاملThe Influence of Location on Nodes’ Centrality in Location-Based Social Networks
Nowadays, due to the widespread use of social networks, they can be used as a convenient, low-cost, and affordable tool for disseminating all kinds of information and data among the massive users of these networks. Issues such as marketing for new products, informing the public in critical situations, and disseminating medical and technological innovations are topics that have been considered b...
متن کاملA novel sleep/wakeup power management in wireless sensor network: A Fuzzy TOPSIS approach
The wireless sensor network (WSN) is typically comprised many tiny nodes equipped with processors, sender/receiver antenna and limited battery in which it is impossible or not economic to recharge. Meanwhile, network lifespan is one of the most critical issues because of limited and not renewal used battery in WSN. Several mechanisms have been proposed to prolong network lifespan such as LEACH,...
متن کامل